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Abstract 

Let P = {pi,p2, ■ ■ ■ ,Pn) be a sequence of points in the plane, where Pi = (xj, j/j) and 
xi < X2 < ■ ■ ■ < xn- a famous 1935 Erdos-Szekeres theorem asserts that every such P 
contains a monotone subsequence 5* of \\/N^ points. Another, equally famous theorem 
from the same paper imphes that every such P contains a convex or concave subsequence 
of ^{}ogN) points. 

Monotonicity is a property determined by pairs of points, and convexity concerns 
triples of points. We propose a generalization making both of these theorems members 
of an infinite family of Ramsey-type results. First we define & {k + l)-tuple C P to be 
positive if it lies on the graph of a function whose fcth derivative is everywhere nonncgative, 
and similarly for a negative {k + l)-tuple. Then we say that 5* C P is kth-order monotone 
if its {k + l)-tuples are all positive or all negative. 

We investigate quantitative bound for the corresponding Ramsey- type result (i.e., 
how large fcth-order monotone subsequence can be guaranteed in every iV-point P). We 
obtain an r2(log*^'^^^^ TV) lower bound ((fc— l)-times iterated logarithm). This is based on a 
quantitative Ramsey-type theorem for transitive colorings of the complete (fc -I- 1) -uniform 
hypergraph (these were recently considered by Pach, Fox, Sudakov, and Suk). 

For A; = 3, we construct a geometric example providing an O(loglog A^) upper bound, 
tight up to a multiplicative constant. As a consequence, we obtain similar upper bounds 
for a Ramsey- type theorem for order-type homogeneous subsets in M^, as well as for a 
Ramsey-type theorem for hypcrplanes in recently used by Dujmovic and Langerman. 



1 Introduction 

In this paper we mainly consider sets P = {pi,P2, ■ ■ ■ ,Pn} of points in the plane, where 
Pi = {xi,yi). We always assume that no two of the x-coordinates coincide, and unless stated 
otherwise, we also assume that the pi are numbered so that Xi < X2 < ■ ■ ■ < xn (the same 
also applies to subsets of P, which we will enumerate in the order of increasing x-coordinates) . 

Two theorems of Erdos and Szekeres. Among simple results in combinatorics, only 
few can compete with the following one in beauty and usefulness: 
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Theorem 1.1 (Erdos— Szekeres on monotone subsequences [ES35]) For every positive 
integer n, among every N = (n — 1)^ + 1 points pi, ■ ■ ■ ,Pn £ cls above, one can always 
choose a monotone subset of at least n points, i.e., indices ii < i2 < ■ ■ ■ < in such that either 
Vh <yi2 <■■■ < Vin or yi^>yi2>---> yi„ ■ 

See, for example, Steele [Ste95] for a collection of six nice proofs and some applications. 
For many purposes, it is more natural to view the above theorem as a purely combinatorial 
result about permutations, but here we prefer the geometric formulation (which is also similar 
to the one in the original Erdos— Szekeres paper). 

Another result of the same paper of Erdos and Szekeres is the following well-known gem 
in discrete geometry:^ 

Theorem 1.2 (Erdos Szekeres on convex/concave configurations [ES35]) For every 
positive integer n, among every N = (^'^12) + 1 ~ 4"/-y/n points pi, . . . .p^ G as above, 
one can always choose a convex configuration or a concave configuration of n points, i.e., 
indices ii < 12 < ■ ■ ■ < in such that the slopes of the segments Pi^Pi^^^, j = 1,2, . . . ,n—l, are 
either monotone nondecreasing or monotone nonincreasing. 

See, e.g., [MSOO, Mat02] for proofs and surveys of developments around this result. 

fc-general position. To simplify our forthcoming discussion, at some places it will be conve- 
nient to assume that the considered point sets are in a "sufficiently general" position. Namely, 

we define a set P to be in k-general position if no /c + 1 points of P lie on the graph of a 
polynomial of degree at most k — 1. In particular, 1-general position requires that no two 
y-coordinates coincide, and 2-general position means the usual general position, i.e., no three 
points collinear. 

feth-order monotone subsets. Here we propose a view of Theorems 1.1 and 1.2 as the 

first two members in an infinite sequence of Ramsey-type results about planar point sets.^ 

In Theorem 1.1, monotonicity of a subset is a property of pairs of points of the subset, and 
actually, it suffices to look at pairs of consecutive points. Similarly, convexity or concavity 
of a configuration in Theorem 1.2 is a property of triples, and again it is enough to look at 
consecutive triples. 

In the former case, we are considering the slope of the segment determined by a pair of 
points, which can be thought of as the first derivative. In the latter case, a triple is convex 
iff its points lie on the graph of a smooth convex function, i.e., one with nonnegative second 
derivative everywhere. 

With this point of view, it is natural to define a {k + l)-tuple X C P to be positive if it lies 
on the graph of a function whose k-th derivative (exists and) is everywhere nonnegative, and 
similarly for a negative {k + l)-tuple (in Section 2, we will provide several other, equivalent 
characterizations of these properties). Then we say that an arbitrary subset S C. P is kth- 
order monotone if its [k + l)-tuples are all positive or all negative. 

First-order monotonicity is obviously equivalent to monotonicity as in Theorem 1.1, and 
second-order monotonicity is equivalent to convexity/concavity as in Theorem 1.2. We will 

^Somewhat unfortunately, the name Erdos-Szekeres theorem refers to Theorem 1.1 in some sources and to 
Theorem 1.2 or similar statements in other sources. 

^There is also a (trivial) 0th member, namely, the statement that in every P, at least half of the points 
either have all j/-coordinates nonnegative or have or all j/-coordinates nonpositive. 
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also see (Lemma 2.5) that, to certify /cth-order monotonicity, it is enough to consider all 
(k + l)-tuples of consecutive points. 

Let us remark that every (k + l)-tuple K is positive or negative, and moreover, if K is 
in A;-general position, it cannot be both positive and negative (Corollary 2.3). We will write 
sgn(if) = +1 if if is positive and sgn{K) = —1 if if is negative. 

Ramsey's theorem, quantitative bounds, and transitive colorings. Using the just 
mentioned facts, one can immediately derive a Ramsey- type theorem for /cth-order monotone 
subsets from Ramsey's theorem. 

Proposition 1.3 For every k and n there exists N such that every N-point planar set in 
k-general position contains an n-point kth-order monotone subset. 

Proof. We recall Ramsey's theorem (for two colors; see, e.g., Graham, Rothschild, and 
Spencer [GRS90]): for every £ and n there exists N such that for every coloring of the set 
(^) of all ^-element subsets of an A^-element set X there exists an n-element homogeneous 
set Y C X, i.e., a subset in which all ^-tuples have the same color. The smallest N for which 
the claim holds is usually denoted by Re{n). 

In our case, we set X = P and color each {k + l)-tuple K C P with the color sgn(if ) G 
{±1}. Then homogeneous subsets are exactly fcth-order monotone subsets. □ 

Let us denote by ESfe(n) the smallest value of N for which the claim in this proposition 
holds. We have ESi(n) < (n - 1)^ + 1 and ES2(n) < {^.^12) + 1 according to Theorems 1.1 
and 1.2, respectively; moreover, these inequalities actually hold with equality [ES35]. Our 
main goal is to estimate the order of magnitude of ESfc(n) for k > 3. 

The above proof gives ESfc(n) < i?fe+i(n). However, for A; = 1, and most likely for all A;, 
the order of magnitude of Rk+i{n) is much larger than that of ESfc(n). Indeed, considering 
k fixed and n large, the best known lower and upper bounds of ii!jt_|_i(n) are of the form^ 
R2(n) = 2®('^) and, for /c > 2, 

twrfe(ri(n^)) < Rk+\{n) < twik+i{0{n)), 

where the tower function twrfc(x) is defined by twri(x) = x and twrj_|_i(a;) = 2*'^''»(^). It is 
widely believed that the upper bound is essentially the truth. This belief is supported by 
known bounds for more than two colors, where the lower bound for (k + l)-tuples is also a 
tower of height k + 1; see Conlon, Fox, and Sudakov [CFSll] for a recent improvement and 
more detailed overview of the known bounds. 

The coloring of the (A; + l)-tuples in the above proof of Proposition 1.3 is not arbitrary. In 
particular, it has a property we call transitivity (see Lemma 2.5). Transitive colorings were 
introduced earlier in the recent preprint Fox et al. [FPSSll, Section 6], under the same name. 

To define a transitive coloring in general, we need to consider a hypergraph whose vertex 
set is linearly ordered; w.l.o.g. we can identify it with the set [N] := {1, 2, . . . , A''}. A 
coloring c: {^f) ^ H is transitive if, for every ii, . . . , ii^i G [A^], ii < ■ ■ ■ < i^+i, whenever 
the ^-tuples {ii, ■ ■ ■ ,ie} and {^2, . . . , ie+i} have the same color, then all i!-clcmcnt subsets of 

^We employ the usual asymptotic notation for comparing functions: /(n) = 0(gr(n)) means that |/(n)| < 
C\g{n)\ for some C and all n, where C may depend on parameters declared as constants (in our case on k); 
f{n) — Q{g{n)) is equivalent to g{n) = 0(f{n)); and /(n) = Q{g{n)) means that both /(n) = 0{g{n)) and 
fin) = n{g{n)). 
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{ii, . . . have the same color. Let Rf^^^{n) denote the Ramsey number for transitive 

colorings, i.e., the smallest N such that any transitive coloring of the complete uniform 
hypergraph on [N] contains an n-element homogeneous subset. We have the following bound.^ 

Theorem 1.4 For k = 1,2, we have i?^™°®(n) = ESfe(n), and for every fixed k >3, 

ESfc(n) < Rt^^n) < twrfe(0(n)). 

We note that Fox et al. [FPSSll] proved the slightly weaker upper bound R)^!^f{n) < 
twrfc (0(n log n)). 

The proof of Theorem 1.4 is given in Section 3. The inequality ESfe(n) < R^^^Y^{n) is 
clear (since every iV-point set in /c-general position provides a transitive coloring of {j^i))- 
The upper bounds for R^^^^^{n) and R^'^''^^(n) follow by translating the proofs of Theorem 1.1 
and 1.2 to the setting of transitive colorings almost word by word, and they are contained in 
[FPSSll]. The upper bound on R)^^i^{n) is then obtained by induction on k, with A; = 3 as 
the base case, following one of the usual proofs of Ramsey's theorem. 

A set with no large third-order monotone subsets. For k < 2, the numbers ESfe(n) (and 
thus R^j^^^f{n)) arc known exactly. Our perhaps most interesting result is an asymptotically 
matching lower bound for ES3(n). 

Theorem 1.5 For alln>2 we have R^i^'^^{2n+l) > ES3(2n+l) > 22""' + 1. Consequently, 
ES3(n) = 22''^"\ 

The proof is given in Section 4. A Ramsey function with known doubly exponential 
growth seems to be rare in geometric Ramsey-type problems (a notable example is a result 
of Valtr [Val04]). 

Order types. Here we change the setting from the plane to and we consider an ordered 
sequence P = {pi,p2, - ■ ■ ,Pn) in l^'^- This time we do not assume the first coordinates to be 
increasing. For simplicity, we assume P to be in general position, which now means that no 
d + 1 points of P lie on a common hyperplane. 

We recall that order type of P specifies the orientation of every {d + l)-tuplc of points 
of P, and it this way, it describes purely combinatorially many of the geometric properties 
of P. More formally, the order type of P is the mapping — >■ {— 1,-|-1}, where for a 
{d + l)-tuple / = {ii, . . .,id+i}, h < i2 < ■■■ < id+i, x{I) ■= sgn det M{pi-^,pi^, . . . ,Pi^^-,), 
where M(gfi, . . . , Qd+i) is the {d+1) x {d+\) matrix whose jth column is (1, qj), i.e., 1 followed 
by the vector of the d coordinates of qj. See, e.g., Goodman and Pollack [GP93] or [Mat02] 
for more background about order types. 

From Ramsey's theorem for [d + l)-tuples, we can immediately derive a Ramsey-type 
result for order types: for every d and n there exists N such that every A-point sequence 
contains an n-point subsequence in which all the (d + l)-tuples have the same orientation 
(we call such a subsequence order-type homogeneous). Let us write OTrf(n) for the smallest 
such A. 

In Section 5 we first observe that, by simple and probably well known considerations, 
OTi(n) = {n-lf+l and 0T2(n) = 2®("). For d > 3, the best upper bound for OTrf(n) we are 

*By inspecting the proof of the next theorem, it is easy to verify that the transitivity condition is not used 
in full strength — it suflBces to assume only that the subsets obtained by omitting one of 12, is have the same 
color. 
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aware of is the one from the Ramsey argimicnt above, i.e., OTd{n) < Rd+i{n) < twr^+i (0(n)). 
In particular, for 0X3(71) this upper bound is triply exponential; in Section 5 we prove a doubly 
exponential lower bound. 

Proposition 1.6 For all d and n, OT(i(n) > ES(i(n). In particular, OT^{n) = 2^"^"'. 

A Ramsey-type result for hyperplanes. Let us consider a finite set H of hyper planes in 
■^d general position (every d intersecting at a single point). Let us say that H is one-sided 
if V{H), the vertex set of the arrangement of H, lies completely on one side of the coordinate 
hypcrplanc = 0. 

Let OSH£/(n) be the smallest N such that every set H of N hyperplanes in R'^ in general 
position contains a one-sided subset of n hyperplanes. Ramsey's theorem for d-tuples imme- 
diately gives OSHrf(n) < Rd{n) (a d-tuple gets color if its intersection has a positive last 
coordinate, and color —1 otherwise). 

Matousek and Welzl [MW92] observed that, actually, 0SH2(n) = ESi(n) = (n - 1)^ + 1, 
and applied this in a range-searching algorithm. Recently Dujmovic and Langerman [DLll] 
used the existence of OSH(i(n) (essentially Lemma 9 in the arXiv version of their paper) 
to prove several interesting results, such as a ham-sandwich and centerpoint theorems for 
hyperplanes. 

In Section 5 we show that lower bounds for fcth-order monotone subsets in the plane can 
be translated into lower bounds for OSH^. 

Proposition 1.7 We have OSH(i(ri) > ESd_i(n), and in particular, 0SH3(n) = 2^^"'^ and^ 
0SH4(n) = 22"<"\ 

The lower bounds for OSH(i(n) can also be translated into lower bounds in the theorems of 
Dujmovic and Langerman. For example, in their ham-sandwich theorem, we have d collections 
Hi,. . . ,Hd of hyperplanes in W^, each of size N, and we want a hyperplane g such that in 
each Hi, we can find disjoint subsets Ai, Bi of n hyperplanes each such V{Ai) lies on one side 
of g and V{Bi) on the other side. 

To derive a lower bound for the smallest necessary A*", we fix d affinely independent points 
pi, . . . ,Pd in the Xd = hyperplane, and a set H oi N hyperplanes in general position with 
no one-sided subset of size n. We let Hi be an affinely transformed copy of H such that all 
of V{Hi) lies very close to pi. Then every potential ham-sandwich hyperplane g for these Hi 
has to be almost parallel to the Xd = hyperplane, and thus there cannot be Ai, Bi of size n 
for all i. 

The work of Fox et al. While preparing a draft of the present paper, we learned about 
a recent preprint of Fox, Pach, Sudakov, and Suk [FPSSll]. They investigated various com- 
binatorial and geometric problems inspired by Theorems 1.1 and 1.2, and as was mentioned 
above, among others, they introduced transitive colorings,^ but mainly they studied a re- 
lated but different Ramsey-type quantity: let N^{q,n) be the smallest integer such that, 
for every coloring of ('^') with q colors, there exists an n-element / = {ii,...,in} Q [N], 

^An exponential lower bound for OSH3 was known to the authors of [MW92], and perhaps to others as 
well, but as far as we know, it hasn't appeared in print. 

^With still another geometric source of such colorings besides the Erdos-Szekeres theorems, namely, non- 
crossing convex bodies in the plane 
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ii < ■ ■ ■ < in, inducing a monochromatic monotone path, i.e., such that all the ^-tuples of the 
form {ij, . . . , j = 1,2, . . . ,n — i + 1, have the same color. 

They note that R^^^^in) < Ni[2, n), and they obtained the following bounds for N(_{2,n): 
N2{2,n) = ESi(n), iV3(2,n) = ES2(n), and for every fixed k > 3, 

twrk{n{n)) < Nk+i{2,n) < twrfe(0(nlogn)). 

As we mentioned after Theorem 1.4, this also yields an upper bound for only slightly 

weaker than the one in that theorem. 

Open problems. 

1. We have obtained reasonably tight bounds for £83(71), but the gaps are much more 
significant for ESfc(n) with /c > 4. According to the cases k = 1,2,3, one may guess 
that ESfc(n) is of order twrfc(0(n)), and thus that stronger lower bounds are needed, 
but a possibility of a better upper bound shouldn't also be overlooked. This question 
looks both interesting and challenging. 

2. A perhaps more manageable task might be a better lower bound for R]i^^'^^{n), k > 4. 
A natural approach would be to imitate the Stepping-Up Lemma used for lower bounds 
for the Ramsey numbers Rk{n) (see, e.g., [CFSll]). But so far we have not succeeded in 
this, since even if we start with a transitive coloring of fc-tuples, we could not guarantee 
transitivity for the coloring of {k + l)-tuples. 

3. As for order-type homogeneous sequences, for 0T3(n) we have the lower bound of 2^"*"'' , 
but upper bound only twr4(0(n)) directly from Ramsey's theorem. It seems that the 
colorings given by the order type are not transitive in any reasonable sense, and we 
have no good guess of which of the upper and lower bounds should be closer to the 
truth. Similar comments apply to the problem with one-sided subsets of planes in 
(concerning 0SH3(n)), and the higher-dimensional cases are even more widely open. 

4. Another interesting question is whether n log n can be replaced by n in the upper bound 
for the quantity N^{2,n) considered by Fox et al. [FPSSll]. 

5. In our definition of fcth-order positivity, every [k + l)-tuple of points should lie on the 
graph of a function with a nonnegative feth derivative, and different functions can be 
used for different (A;-|-l)-tuples. In an earlier version of this paper, we conjectured that, 
assuming /c-general position, a single function should suffice for all {k + l)-tuples; in 
other words, that every fcth-order monotone finite set finite set in fc-general position 
lies on a graph of a fc-times differentiable function / : M — )■ M whose fcth derivative is 
everywhere nonnegative or everywhere nonpositive. 

However, Rote [Rot 12] disproved this for fc = 3 (while the cases fc = 1, 2 do hold, as is 
not hard to check). With his kind permission, we reproduce his example at the end of 
Section 2. 

Naturally, this opens up interesting new questions: How can one characterize point sets 
lying on the graph of a function whose fcth derivative is positive everywhere? Is there a 
Ramsey-type theorem for such sets, and if yes, how large is the corresponding Ramsey 
function? 
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2 On the definition of fcth-order monotonicity 

Here we provide several equivalent characterizations of fetli-order monotonicity of planar point 
sets and some of their properties. First we recall several known results. 

Divided differences and Newton's interpolation. Let pi,p2,. ■ ■ ,Pk+i be points in the 

plane, pi = {xi,yi), where the Xi are all distinct (but not necessarily increasing). We recall 
that the kth divided difference Ak{pi,p2, ■ ■ ■ ,Pk+i) is defined recursively as follows: 

^o(Pi) := yi 

^k-l{P2,P3, ■ ■ ■ ,Pk+l) - Afc_l(pi,P2, ■ • . ,Pk) 



^kiPl,P2,---,Pk+l) ■ = 



Xk+l - Xl 



For example, Ai(pi,p2) equals the slope of the line piP2- In general, the kth divided difference 
is related to the A;th derivative as follows (see, e.g., [Phi03, Eq. 1.33]; note that the case k = 1 
is the Mean Value Theorem): 

Lemma 2.1 (Cauchy) Let the points pi, . . . ,Pk+i, a := xi < X2 < ■ ■ ■ < b := Xk+i, lie on 
the graph of a function f such that the kth derivative f^^^ exists everywhere on the interval 
(a, h) . Then there exists ^ G (a, h) such that 

Afc(pl, . . . ,pk+l) = — — ■ 

We will also need the following result (see, e.g., [PhiOS, Eq. 1.11-1.19]). 

Lemma 2.2 (Newton's interpolation) Let p\, . . . ^pk+i G he points with distinct x- 
coordinates (here we need not assume that the x-coordinates are increasing) . Then the unique 
polynomial f of degree at most k whose graph contains pi,. ■ ■ ,Pk+i is given by 

k+l / i-1 



i=l ^ j=l 

In particular, the coefficient of x^ is A(pi, . . . ,pfc_|_i), and it equals f^^\x)/k\ (which is a 
constant function). 

We recall that a (A; + l)-tuple K = {pi, ■ ■ ■ ,Pk+i} was defined to be positive if it is 
contained in the graph of a function having a nonnegative A;th derivative everywhere. We 
obtain the following equivalent characterization: 

Corollary 2.3 ^4 (A: + l)-tuple K = {pi, . . . ,Pk+i\ is positive iff A^^pi, . . . ,Pk+i) > (and 
similarly for a negative {k + l)-tuple). If K is in k-general position, we have sgnK = 
sgnAfc(pi,...,pfc+i). 

Proof. If K is contained in the graph of / with f^'^^ > everywhere, then Afe(pi, . . . ,Pk+i) > 
by Lemma 2.1. 

Conversely, if Ak{pi, ■ ■ ■ ,Pk+i) > 0) then by Lemma 2.2, the unique polynomial of degree 
at most k whose graph contains K is the required function with nonnegative fcth derivative. 

If, moreover, K is in fc-general position, then Ak{pi, ■ ■ ■ ,Pk+i) 7^ 0, and so K cannot be 
both /cth-order positive and A;th-order negative by Lemma 2.1. □ 

We will also need the following criterion for the sign of a (fc + l)-tuple. 
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Lemma 2.4 Let K = {pi,P2, ■ ■ ■ ,Pk+i} he a {k + 1) -tuple of points in k-general position, 
xi < ■ ■ ■ < a^fc+i, let i £ [k + 1], and let fi he the (unique) polynomial of degree at most k — 1 
whose graph passes through the points of K \ {pi}. Then sgni^ = (—1)'^"* if pi lies below the 
graph of fi, and sgnK = (—1 if lies above the graph. 

Let / be the polynomial of degree at most k passing through all of K. We use Newton's 
interpolation (Lemma 2.2), but with the points reordered so that pi comes last, and we get 
that 

f{x) = fi{x) + Ak{pi,...,Pi-i,Pi+i,...,Pk+i,Pi) Yi i^-^j)- 

je[k+i]\{i} 

Using this with x = Xi, we get 

sgn.{yi - fi{xi)) = sgn{f{xi)-fi{xi)) 

= sgn Ak{pi,..., Pi-i,Pi+i,...,Pk+i, Pi) -sgn JJ (xi-Xj). 

je[k+i]\{i} 

Divided differences are invariant under permutations of the points (as can be seen, e.g., 
from Lemma 2.2, since the interpolating polynomial does not depend on the order of the 
points), and so sgn Afe(pi, . . . . . .,pk+i,Pi) = sgnK. Finally, the product njg[fc+i]\{j}(a;j- 

Xj) has k + 1 — i negative factors, thus its sign is (— l)*^"*"-^"*, and the lemma follows. □ 

It remains to prove transitivity. 

Lemma 2.5 Let P = {pi, . . . ,pn} he a point set in k-general position. Then the 2-coloring 
of {k + l)-tuples K G (fe^i) hy their sign is transitive. 

Proof. We consider a (A;+2)-tuple L = {pi, . . . ,pk+2] with sgn{pi, . . . ,pk+i] = sgn{p2, • • ■ ,Pk+2] = 
+1, and we fix i G {2, . . . , A; + 1}. Let fi^k+2 be the polynomial of degree at most k — 1 passing 
through L\{pi,pk+2], and similarly for /i,fc+2- Our goal is to show that fi,k+2ixk+2) < yk+2, 
since this gives sgn(L \ {pi}) = +1 by Lemma 2.4. 

Since sgn(L \ {pi}) = +1, we have fi,k+2{xk+2) < yk+2 (Lemma 2.4 again), and so it 
suffices to prove fi,k+2{xk+2) < fi,k+2{xk+2)- 

Let us consider the polynomial g := fi^k+2 ~ fi,k+2i as explained above, our goal is proving 
sgn g{xk-^2) = +1- To this end, we first determine sgng(xi): We have fi,k+2{xi) = yi and 
sgn(yi — fi k^2{xi)) = (—1)'^ (using sgn(L \ {pi}) = +1 and Lemma 2.4). Hence sgn5r(xi) = 
(-1)^=-!. 

Next, we observe that 5 is a polynomial of degree at most k — 1, and it vanishes at 
X2, . . . , Xi-i,Xi-\-i, . . . , Xk+i. These are k — 1 distinct values; thus, they include all roots of g, 
and each of them is a simple root. Consequently, g changes sign {k — l)-times between xi 
and Xk+2- Hence, finally, sgng{xk+2) = (— l)'^"^ sgng'(a;i) = +1 as claimed. □ 

Rote's example. Fig. 1 shows a 6-point set P = {pi, . . . ,pq} in 3-general position (no 
four points on a parabola). It is easy to check 3rd-order positivity using Lemma 2.4: By 
transitivity, it suffices to look at 4-tuplcs of consecutive points. For Pi,..-,P4 we use the 
parabola through ^1,^2,^3 (which actually degenerates to the a;-axis); for p2, • • • ,P5 we use 
the dashed parabola through P2,P3, Pi (which is very close to the x-axis in the relevant region) ; 
and for p3, . . . ,p6) the parabola through P4,P5,PG (drawn full). 
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Figure 1: Rote's example: a 6-point 3rd-ordcr positive set in 3-general position that does 
not lie on the graph of any function with nonnegative 3rd derivative. 

It remains to check that P does not lie on the graph of a function / with /^'^^ > 
everywhere. Assuming for contradiction that there is such an /, we consider the point q := 
{xq, /(xq)), where xq is such that the full parabola is below the x-axis at xq. For the 4-tuple 
{p1jP2,P3,q} to be positive, q has to lie above the x-axis, but the 4-tuple {q,P4:,P5,Pe} is 
positive only if q lies below the parabola through P4,P5,P6 — a contradiction. 

3 Upper bounds on the Ramsey numbers for transitive color- 
ings 

In this section we prove Theorem 1.4. As we mentioned in the remark following that theorem, 
it suffices to establish the case A; > 3. 

Thus, we want to prove that i?j5^^(n) < twrfe(Cjtn) for all n and for every k > 3, with 
suitable constants Ck depending on k. As the base of the induction we use R^'^^^{n) < 4", 
which, as was remarked earlier, follows by imitating the proof of Theorem 1.2. 

Thus, let A; > 3 be fixed, let n be given, and let us set M := We will prove that 

R'k^Tin) <N ■.= 2^'. (1) 

Theorem 1.4 then follows from this recurrence and from the fact that 2*^'^'=-i("')'° < twr f;{kn) 
for k > 3, which is easy to check. 

To prove (1), we follow an inductive proofs of Ramsey's theorem going back to Erdos 
and Rado [ER52]. Let x- (l+i) ~^ be an arbitrary transitive 2-coloring. We set 

Ak-i := {1, 2, . . . , A; - 1} and Xk^i := [N] \ Ak-i- For i = k,k + 1, . . . , M we will inductively 
construct sets Ai,Xi C [N] such that 

(i) Ai < Xi (i.e., all elements of Ai precede all elements of Xi); 

(ii) l^il = i and \Xi\ > \Xi^i\/2^'''' ; and 

(iii) the color of a (A; + l)-tuple whose first k elements all belong to Ai does not depend on 
its last element; in other words, for K G {^) and x,y & AiU X^ with K < {x,y}, we 
have x{K U {x}) = x{K U {y}). 

For the inductive step, suppose that Ai and X^ have already been constructed. We let Xj 
be the smallest element of Xi, we set Aj+i := Ai U {xj}, and wc write X'^ := Xi \ {xj}. 

Let us call two elements x,y € X[ equivalent if we have, for every K G (^'Ji)) U 
{xi,x}) = x(iiru {xj, y}). There are possible choices of K, and hence there are at most 
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< 2^'^ ^ equivalence classes. We choose Xj+i C X[ as the largest equivalence class. 
Then (i), (iii) obviously hold for Aj+i and Xj+i, and we have \Xi+i\ > {\X^\-l)/{2^""'-l) > 
|Xi|/2*^'"' (since i < M and thus wc have \Xi\ > N/{2^''-^y-^ = 2^'-(^-i)^'"' > 2^*"'). 
This finishes the inductive construction of Ai and Xi. 

In this way, we construct the sets A := Am and Xm (note that \Xm\ > 1 by (ii)). Let x 
be the first element of Xm, and let us define a 2-coloring x* '■ (^) ~^ {!> 2} of the /c-tuples of 
Ahyx*{K) :=x{KU{x}). 

We claim that, crucially, x* is transitive (which is not entirely obvious). So wc consider ele- 
ments oi < 0,2 < • • • < Ofc+i of A, and we suppose that X*({<^1) ■ ■ ■ , o-k}) = X*{{'^2, ■ ■ ■ , Ofc+i}) 
c. We want to show that x*({ai, . . . , ak+i} \ {ai}) = c for every i = 2,S, . . . ,k. We have 
c = X*({ai, • • • ,afc}) = • • ■,ak,x}) = xi{ai, ■ ■ .,ak+i}) (by definition and by the inde- 

pendence of X of the last element), and c = X*({fl2, • • • , Ofc+i}) = x{{a2, ■ ■ ■ , a,k+i,x}). Next 
we use the transitivity of x on the (/c+2)-tuple (ai, . . . , a^+i, x), obtaining x({<^i) • • • > o^fc+i) 
{ai}) = c = x*({ai, . . . , ajfc+i} \ {ai}) as needed. 

Now we can apply the inductive hypothesis to A, which yields an n-element subset of A 
homogeneous w.r.t. x*, and this subset is homogeneous w.r.t. x as well, finishing the proof of 
Theorem 1.4. □ 



4 A lower bound for ES3 

Here we prove Theorem 1.5, a lower bound for ES3(2n -|- 1). We proceed by induction on n; 
the goal is to construct a set Pn of A'' := 2^" points with no (2n -|- l)-point third-order 
monotone subset. The induction starts for n = 2 with an arbitrary P2 of size 2^ = 4. 

In the inductive step, given P„, we will construct Pn+i so that |-Pn+i| = I-Pnpj then the 
bound on the size of P„ clearly holds. 

We may assume that P = P„ is in 3-general position (this can always be achieved by a 
small perturbation). By an affine transformation we also make sure that P C [1,2] x [0, 1]; 
or actually, P C [1,1-9] x [0, 1] so that there is some room for perturbation. Moreover, there 
is a small S > such that if P' is obtained from P by moving each point arbitrarily by at 
most 5, then P' is still in 3-general position, the order of the points of P' along the x-axis 
is the same as that for P, and the sign of every 4-tuple in P' is the same as the sign of the 
corresponding 4-tuple in P. 

The construction. The construction of Pn+i from P = P„ as above proceeds in the following 
steps. 

1. We choose a sufficiently large number A = A{P) (the requirements on it will be specified 
later), and we set e := 1/A'^. 

2. For every point p £ P, let Qp be the image of P under the affine map that sends the 
square [1,2] x [0,1] to the axis-parallel rectangle of width e, height e^, and with the 
lower left corner at p; see Fig. 2. 

3. Let ippix) = Ax"^ + Cp be a quadratic function, where A is as above and Cp is chosen so 
that ipp^xip)) = (where x{p) is the x-coordinate of p). Let Qp be the set obtained by 
"adding ipp to Qp", i.e., by shifting each point {x,y) G Qp vertically upwards by ipp{x). 
We set Pn+i := [jp^pQp- We call the Qp the clusters of Pn+i- 
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Figure 2: A schematic illustration of the construction of Pn+i- 

First we check that each cluster Qp lies close to p. 
Lemma 4.1 Each Qp is contained in an 0{y/e ) -neighborhood of p. 

Proof. Writing p = {xo,yo), the set Qp obviously lies in the 2£-neighborhood of p, and the 
maximum amount by which a point of Qp was translated upwards is at most 

^p(xo + e) = A {{xo + ef - x^) = A{2xQe + e^) = O(v^). 

□ 

Here is a key property of the construction. 

Lemma 4.2 (Slope lemma) Let X be a parabola passing through three points of Pn+i that 
belong to three different clusters, or a line passing through two points of different clusters. Let 
ji be a parabola passing through three points of a single cluster Qp, or a line passing through 
two such points. Then the maximum slope (first derivative) of A on the interval [1,2] is 
smaller than the minimum slope of on [1,2], provided that A was chosen sufficiently large. 

Proof. Clearly, the maximum slope of any such A can be bounded from above by some 
finite number depending only on P but not on A. Thus, it suffices to show that, with A large, 
for every ji as in the lemma, the minimum slope is bounded from below by A. 

First let us assume that /i is a parabola passing through three points of Qp, where p = 
(a^Oj^o)) let fi be the parabola passing through the corresponding three points of P, and let 
the equation of /i be y = ax'^ -\-bx + c. 

By the construction of Qp, the affine map transforming P to Qp sends a point with 
coordinates (x, y) to the point {e{x — 1) + xq, e^y + yo). Calculation shows that the image of 
ft under this affine map has the equation y = ax"^ + (2ae + be — 2axQ)x + c' , where the value 
of the absolute term c' need not be calculated since it doesn't matter. Hence the minimum 
slope of this curve on [1, 2] is bounded from below by — (8|a| + 4|a|£ + 2|6|£ + 8|a|). Finally, ji 
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Type 3 + 1 Type 1 + 1 + 2 Type 1 + 2 + 1 Type 2 + 2 



Figure 3: Determining the signs of 4-tuples by type. 

is obtained by adding il^p{x) = Ax^ + Cp to this curve, and the minimum slope of V'p on [1, 2] 

is at least 2 A. 

Next, let be a line passing through two points g, r G Qp. Let us choose another point 
s ^ Qp and consider the parabola ji' through g, r, s. By the Mean Value Theorem, the slope 
of /X equals the slope of ji' at some point between q and r, and the latter is at least A by the 
above. The lemma is proved. □ 

Let K = {pi,P2,P3,P4} C P„+i be a 4-tuple, pi = {xi, yi), xi < ■ ■ ■ < X4. We assign a type 
to K, which is an ordered partition of 4 given by the distribution of K among the clusters; 
for example, K has type 1 + 1 + 2 if the first point pi lies in some Qp, p2 lies in Qp/ for p' / p, 

and P3,P4& Qp", p" ^P,p'- 

The next lemma shows that the sign K is determined by its type. We provide a complete 
classification, although we will not use all of the types in the subsequent proof. 

Lemma 4.3 Let K = {pi,P2,Pz,P4} C Pn+i he a 4-tuple. If K is of type 1 + 1 + 1 + 1 or 4, 
then the sign of K is the same as that of the corresponding A-tuple in P. Otherwise, the sign 
of K is determined by its type as follows: 

• for types 3 + 1 and 1 + 3 it is — 1; 

• for types 1 + 1 + 2 and 2 + 1 + 1 it is +1; 

• for type 1 + 2 + 1 it is —1; and 

• for type 2 + 2 it is +1. 

Proof. Since the transformation that converts P into Qp preserves the types of 4-tuples, 
the statement for type 4 is clear. The statement for type 1 + 1 + 1 + 1 follows since, by 
Lemma 4.1, K is obtained by a sufficiently small perturbation of the corresponding 4-tuple 
in P (this gives one of the lower bounds on A, since we need the bound in Lemma 4.1 to be 
smaller than the 5 considered at the beginning of our description of the construction). 

The statements for the remaining types are obtained by simple application of the slope 
lemma (Lemma 4.2) together with Lemma 2.4. Namely, for type 3 + 1, we get that the 
parabola through pi,P2,P3 lies above p4 (by comparing its slope to the slope of the line P3P4); 
see Fig. 3. For type 1 + 3 we similarly get that pi lies above the parabola through P2,P3,P4, 
and so the sign is —1 in both of these cases. 

For type 1 + 1 + 2, the segment P3P4 is steeper than the parabola through P1P2P3, and so 
the sign is +1. Similarly for type 2 + 1 + 1 we get that pi lies below the parabola through 
P2,P3,P4, which again gives sign +1. For type 1 + 2 + 1, p3 lies above the parabola through 
pi,P2,P4, giving sign —1. Finally, for type 2 + 2, the segment pip2 is steeper than P2P3, thus 
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the parabola through pi,P2,P3 is concave, and hence its slope at and after it is no larger 
than the slope of the segment P2P3- Thus, p4 lies above this parabola and the sign is +1 as 
claimed. □ 



Finishing the proof of Theorem 1.5. It remains to show that Pn+i contains no (2n + 3)- 
point third-order monotone subset. 

For contradiction, suppose that M C P„+i is such a (2n + 3)-point subset. Let 2n + 3 = 
ni + n2 + • • • + ris be the type of M (i.e., M has nj > 1 points in the ith leftmost cluster it 
intersects). By the inductive assumption we have s < 2n and rii < 2n for all i. 

Let Ha = maxiHi and n;, = maxj^^nj be the two largest among the rij. For convenience, 
let us assume a < b; the case o > 6 is handled symmetrically. We distinguish three cases. 

First, if > 3 and rif, > 2, then wc can select 4-tuples of types 3 + 1 and 2 + 2 from the 
corresponding two clusters, which have different signs, and so M is not homogeneous. 

Second, if > 3 and = 1, then we have at least three rii equal to 1 (since Ua < 2n), 
and at least two of them lie on the same side of the cluster corresponding to Ua, say to the 
right of it. Then we can select 4-tuples of types 3 + 1 and 2 + 1 + 1, again of opposite signs. 

Third, if = 2, then there are at least two other clusters of size 2. From these three 
2-element clusters, we can select 4-tuples of types 2+2 and 1+2+1, again of opposite signs. 

This exhausts all possibilities {ua = 1 cannot happen, because s < 2n), and Theorem 1.5 
is proved. □ 



5 Order types and one-sided sets of hyperplanes 

First we substantiate the two claims made above Proposition 1.6, concerning OTi and OT2. 
For d = 1, an order- type homogeneous sequence in is just a monotone sequence of real 
numbers, so OTi(n) = (n — 1)^ + 1 by Theorem 1.1. 

In a similar spirit, it is easy to check that a planar order-type homogeneous sequence 
corresponds to the vertices of a convex n-gon, enumerated in a clockwise or counterclockwise 
order. Thus, 0T2(n) > ES2(rn/2]) = 2"("). On the other hand, given any A''-point sequence, 
we can first select a subsequence of [VN ] points with increasing or decreasing ^-coordinates, 
and then we select a convex or concave configuration from it. Thus, by Theorem 1.2, we have 
0T2(n) = 20("). 

Proof of Proposition 1.6. For a point p = {x,y) G M^, we define the point p := 

To prove that ESd{n) < OTd{n), we consider a set P = {pi, . . . ,pn} C in d-general 
position. Pi = (xi,yi), where N = ESrf(n) — 1 and xi < • • • < x^, with no dth-order monotone 
subset of n points. It suffices to prove that the sequence P := {pi,P2, ■ ■ ■ ,Pn) has no n-point 
order-type homogeneous subsequence. This follows from the next lemma. 

Lemma 5.1 For every {d+ l)-tuple (pi, . . . of points in E?, xi < • • • < x^+i, we have 

sgn({pi, . . . = sgndetM(pi,^2, • • • where M{qi, . . . ,qd+i) is the matrix from 

the definition of order type above Proposition 1.6. 

Proof. By Lemma 2.2 and Corollary 2.3, the sign of {pi, . . . ,Pd+i\ equals the sign of the 
coefficient of the unique polynomial f{x) = X^j=o ^i^^ degree at most d whose graph 
passes through the points pi, . . . ,Pd+i- 
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The vector a = (ao, • • • , ad) can be expressed as the solution of the Hnear system Va = y, 
where y = {yi, . . . , yd+i) and V is the Vandermonde matrix with d+ 
1. By Cramer's rule, we obtain 

detW 
"'^^ d^' 

where W stands for the matrix V with the last column replaced with the vector y. As is well 
known, det V = ni<i<j<d+i(^j ~ ^«)> since xi < ■ ■ ■ < x^+i, we have det V > 0. Thus, 
sgnorf = sgndetT^. Finally, we have 



W = 



/ 1 XI xl ... xf''^ yi \ 

: : : : : : = M{pi,p2, . . . ,Pd+i)^ ■ 
\ 1 Xd+i x^+i . . . x^;} yd+i J 



The lemma follows, and Proposition 1.6 is proved. □ 

Proof of Proposition 1.7. The proof is very similar to the previous one. This time 

we start with a set P = {pi, . . . ,pn} C in (d — l)-general position, pi = {xi,yi), where 
N = ESd-i(ra) — 1 and xi < • • • < xat, with no {d — l)th-order monotone subset of n points. 
We define a collection H = {hi, . . . ,hi\f} of N hyperplanes in W^, where hi is given by 



= |(6,...,C<i)eM'^:^xr'6=2/.| 
>- j=i ^ 



The intersection point ^ = (^i, • • • , ^d) of, say, hi,. . . ,hd is the solution of the linear system 
= where V is the d x d Vandermonde matrix this time, vij = x^~^. Cramer's rule then 
gives that the dth coordinate ^d, whose sign we are interested in, equals {detW)/{detV), 
where W is obtained from V by replacing the last column with y. 

As we saw in the proof of Proposition 1.6, (detW) / (detV) also expresses the leading 
coefficient in the polynomial of degree d — 1 passing through pi, . . . ,pd, and thus its sign 
equals sgn A(i_i(pi, . . . ,pd)- It follows that one-sided subsets of H precisely correspond to 
{d — l)st-order monotone subsets in P, and the proposition is proved. □ 
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